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Abstract 

We highlight the difFerent uses of the word likelihood that have arisen in statistics and meteorology, 
and make the recommendation that one of these uses should be dropped to prevent confusion and 
misunderstanding. 



1 Introduction 

We discuss the difFerent meanings of the word likelihood as used in classical statistics and meteorology. 
In section 121 we describe how the word is used in classical statistics, and in section Owe describe how it 
has been used in meteorology. In section^ we discuss the differences and express the opinion that one of 
the two uses should be dropped. In section Owe summarize. 



2 Classical statistics definition of likelihood 

Imagine that we have two datasets Xi and y^, for a range of values of i. We might attempt to build a 
statistical model for Xi in terms of y^, or yi in terms of Xi. 

As an example we will consider linear regression, and will consider building a model for yi in terms of Xi. 
The model we will consider can be written in the following equivalent ways: 



= a + (3xi + ae.i (1) 

where 



e, -iV(0,l) 
Ui ~ N{a + I3xi,a'^) 



(2) 
(3) 



or 



p{y,\x,) = N{a + (3x,,a^) (4) 

To fit the parameters (a, /?, a) of this model given the data Xi and yi one would typically consider finding 
those parameters that maximise the likelihood L(a, P,a), where L is defined as: 



L{a, /3, a) = p{y\x, a, /3, a) (5) 

In the case of linear regression, this happens to be equivalent to finding parameters that minimise the 
sum of squared errors. 

We note that the definition of L as L{a, /3, a) = p{y\x, a, /?, a) arises because we are using Xi to model yi 
i.e. X is the 'input', 'independent variable', 'predictor', 'regressor', 'explanatory variable' or 'covariate', 
while y is the 'output', 'dependent variable', 'regressand', 'response variable' or 'predictand'. 
If we had set out to use yi to model Xi (using linear regression, or any other model) then the likelihood 
would have been defined as L{6) = p{x\y,6) where 6 represents the parameters of this new model. We 
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see that the definition of fikehhood is thus entirely dependent on what is being used to model what, and 
what model is being used. 

In general, once we have decided on a model then the likelihood is the probability density (or, for discrete 
rather than continuous models, just the probability) of the predictand given the predictors, as a function 
of the parameters of the model. 

U se of the wor d likelihood in this context comes from the original works of Fisher in the 1920s, such 

as lFisheil(|l922l) . A recent overview of the use of the likelihood in statis tical inference is giv en bv lCasella and Bergeil 
l(2002l) . and likelihood-based fitting of parameters is also discussed in lPress et al.l 1 1992I) . 



2.1 Application to making probabilistic forecasts 

We now consider a meteorological application: making probabilistic forecasts. Probabilistic forecasts 
are made by taking the inherently non-probabilistic output from numerical models (consisting of, for 
example, single integrations, ensemble members, or ensemble means and spreads) and fitting a statistical 
model to them to generate probabilities. In other words, given a non-probabilistic model forecast / we 
make a probabilistic forecast of the observations o. Our forecast can be written as p{o\f) i.e. a probability 
distribution of different possible observations, given the forecast we have available. Note that the notation 
p{o\f) does not yet specify what model is used to convert / to o. 

For temperature, a reasonable way to make a probabilistic forecast is to use linear regression, with the 
input fi being either a single forecast or an ensemble mean. This then means we can write: 

p{o,\f,)^N{a + f]f,,a^) (6) 

If we wish to use information from the mean rrij an d spread Si of an ensemble forecast, then we can use 
the spread regression model of Ijewson et alJ l|2003|) : 

pio,\f,)^N{a + pm,,{-, + 6s,)^) (7) 

The usual way to fit the parameters of either of these models would be to find those parameters that 
maximise the likelihood, defined as L = L{a,(3,a) ~ p{o\f, a, (3, a) (for the regression model) or L — 
L{a,(3,j,5) = p{o\f,a,l3,j,S) (for the spread regression model). The likelihood is defined as p{o\f) 
simply because we are trying to predict the observations o from the forecast /. If, for some reason, we 
wanted to predict the forecast from the observations (it is not immediately obvious why one would want 
to do this, but there may be reasons), then we would define the likelihood as p{f\o). 



3 Murphy and Winkler definition of likelihood 

iM^irphv and Winkleil l|l987l) (henceforth MW) discuss ways in which one can validate probabilistic fore- 
casts, and in particular introduce the following definitions: 

• Pif\o) is the likelihood 

• p{o) is the base rate 

• p{o\f) is the calibration 

• p{f) is the refinement 

Follow ing this paper, a number of other meteorologists (such as lJoUiffe and Stephensoiil l)2003tl and lWilki 

l)200ll) ) have used the word likelihood to refer to p{f\o) and the word calibration to refer to p{o\f). 



4 Discussion 

We see that the MW definition of the word likelihood is subtly different from the original definition as 
used in classical statistics. In particular, MW define likelihood once and for all as p{f\o) irrespective of 
whether o is being modelled in terms of /, or / is being modelled in terms of a. The classical statistics 
definition of likelihood, on the other hand, depends on what is being used to model what. 
This creates some confusion, especially when one tries to apply classical statistical methods to forecast 
calibration as described in section Because of this, we advocate that the MW definition should not 
be used. Our reasons for taking this position are: 



• The classical statistical definition of the word likelihood is the original definition. 

• It is used, and understood, by many thousands of applied mathematicians. 

• It has been in use for over 80 years. 

• The MW definition is a restriction of the original definition. 

• It is only used, and understood, by a very small number of meteorologists involved in the field of 
probabilistic forecast verification. 

• It is very recent. 

5 Summary 

Statisticians have used the phrase like lihood for over 80 year s , with a particular meaning, following ' Fished 
jr922l|) . A relatively recent paper bv iMurphv and Winkled ()l987f) attempts to redefine this word when 
applied to meteorological forecasts and observations. This undermines the original meaning, creates 
confusion, and is not very helpful in building connections between statistics and meteorology. We there- 
fo re strongly advise that mete orologists working in forecast verification sh ould not use the definition 
of iMurphv and Winkled l^sj), and should stick to the original definition of lFished l)l922|) . 
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